Hades: a Distributed System for Dependable Hard Real-time Applications Built from Cots Components
نویسندگان
چکیده
Most dependable embedded real-time systems designed in the past have been specialized to meet the speciic requirements of the application domain for which they were targeted, leading to innexible and often hardware-intensive solutions that are costly to design and maintain. This paper is devoted to the description of Hades, a software infrastructure to develop and execute distributed dependable hard real-time applications. For economic and maintainability reasons, this infrastructure is based entirely on Commercial OO-The-Shelf (COTS) components (motherboard, network and real-time kernel). The Hades run-time support, built as a middleware layer on top of an unmodiied COTS real-time kernel, provides basic real-time distributed fault-tolerant services (clock synchronization, reliable multicast, failure detection), as well as error detection, error recovery and fault treatment mechanisms. The error detection mechanisms ensure that every computer is fail-silent, allowing to design low-cost error recovery mechanisms. The paper is devoted to the description of hades and to a comprehensive evaluation of its performance, the latter allowing to apprehend the limits that can be reached when using COTS components to execute applications with hard real-time and dependability requirements. Hades: un systtme distribuu pour applications temps-rrel strict ssres de fonctionnement construites partir de composants COTS RRsumm : La plupart des systtmes embarquus temps-rrel connus dans le passs l'ont tt pour satisfaire aux exigences particuliires du domaine d'application cible. Ceci a menn des solutions non exibles ayant souvent recourt de maniire intensive du mattriel dddii qui est cooteux concevoir et maintenir. Cet article est consacrr la description de Hades, une infrastructure logicielle pour ddvelopper et exxcuter des applications distribuues temps-rrel strict ssres de fonctionnement. Pour des raisons de coot d'achat et maintenabilitt, cette infrastructure est entiirement basse sur des composants COTS (acronyme de Commercial OO-The-Shelf *), savoir carte mmre, rrseau et noyau temps-rrel. Le support d'exxcution de Hades, construit comme un intergiciel (middleware) au dessus d'un noyau temps-rrel COTS non modiii, fournit des services distribuus de base pour la tollrance aux fautes (synchronisation d'horloges, diiusion able, ddtection d'arrrts), ainsi que des mmcanismes de ddtection d'erreurs, de recouvrement d'erreurs et de traitement de fautes. Les mmcanismes de ddtection d'erreurs assurent que chaque calculateur est silence sur ddfaillance, ce qui permet de concevoir des mmcanismes de recouvrement d'erreurs faible coot. Aprrs la description de Hades, nous prrsentons une valuation exhaustive de ses performances, ce qui permet d'apprrhender les limites qui peuvent tre atteintes en utilisant des composants COTS pour exxcuter des applications avec …
منابع مشابه
Experimental Evaluation of the Fail-Silent Behavior of a Distributed Real-Time Run-Time Support Built from COTS Components
Mainly for economic and maintainability reasons, more and more dependable real-time systems are built from Commercial Off-The-Shelf (COTS) components. To build these systems, a commonly-used assumption is that computers are fail-silent. The goal of our work is to determine the coverage of the fail-silence assumption for computers executing a real-time run-time support built exclusively from COT...
متن کاملAre COTS Suitable for Building Distributed Fault-Tolerant Hard Real-Time Systems?
For economic reasons, a new trend in the development of distributed hard real-time systems is to rely on the use of CommercialO -The-Shelf (cots) hardware and operating systems. As such systems often support critical applications, they must comply with stringent realtime and fault-tolerance requirements. The use of cots components in distributed critical systems is subject to two fundamental qu...
متن کاملExperimental Evaluation of the Fail-silent Behavior of a Distributed Real-time Run-time Support Built from Cots Components P. Chevochot, I. Puaut
Mainly for economic and maintainability reasons, more and more dependable real-time systems are built from Commercial OO-The-Shelf (COTS) components. To build these systems, a commonly-used assumption is that computers are fail-silent. The goal of our work is to determine how far it is possible to reach the fail-silence assumption for computers executing a real-time run-time support built exclu...
متن کاملA Flexible Run-time Support for Distributed Dependable Hard Real-time Applications
Typically, most distributed, dependable, real-time systems designed in the past can only meet the particular requirements of the application domain to which they were targeted. This approach led to specific, non-flexible, dedicated and non-reusable solutions, often based on specialized hardware. This paper presents an alternative approach where a flexible run-time support for distributed depend...
متن کاملThe DEAR-COTS Replication Framework
The DEAR-COTS (Distributed Embedded ARchitecture using Commercial Off-The-Shelf components) architecture provides a COTS-based framework to execute reliable hard real-time applications. To ease the task of building reliable realtime applications on top of COTS components, a simple and transparent programming model is provided, in order that programmers abstract from the low-level implementation...
متن کامل